Overview
Brought to you by YData
Dataset statistics
| Number of variables | 14 |
|---|---|
| Number of observations | 52318 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 6.0 MiB |
| Average record size in memory | 120.0 B |
Variable types
| Numeric | 9 |
|---|---|
| Categorical | 4 |
| Boolean | 1 |
age_of_first_emp is highly overall correlated with cb_person_cred_hist_length and 2 other fields | High correlation |
cb_person_cred_hist_length is highly overall correlated with age_of_first_emp and 1 other fields | High correlation |
cb_person_default_on_file is highly overall correlated with loan_grade and 1 other fields | High correlation |
loan_amnt is highly overall correlated with loan_percent_income | High correlation |
loan_grade is highly overall correlated with cb_person_default_on_file and 1 other fields | High correlation |
loan_int_rate is highly overall correlated with cb_person_default_on_file and 1 other fields | High correlation |
loan_percent_income is highly overall correlated with loan_amnt | High correlation |
person_age is highly overall correlated with age_of_first_emp and 1 other fields | High correlation |
person_emp_length is highly overall correlated with age_of_first_emp | High correlation |
id is uniformly distributed | Uniform |
id has unique values | Unique |
person_emp_length has 6749 (12.9%) zeros | Zeros |
Reproduction
| Analysis started | 2025-03-11 17:37:31.180811 |
|---|---|
| Analysis finished | 2025-03-11 17:37:42.576852 |
| Duration | 11.4 seconds |
| Software version | ydata-profiling v0.0.dev0 |
| Download configuration | config.json |
Variables
id
Real number (ℝ)
Uniform  Unique 
| Distinct | 52318 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 29328.758 |
| Minimum | 0 |
|---|---|
| Maximum | 58644 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 817.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2929.85 |
| Q1 | 14629.25 |
| median | 29314.5 |
| Q3 | 44022.75 |
| 95-th percentile | 55744.15 |
| Maximum | 58644 |
| Range | 58644 |
| Interquartile range (IQR) | 29393.5 |
Descriptive statistics
| Standard deviation | 16958.414 |
|---|---|
| Coefficient of variation (CV) | 0.57821793 |
| Kurtosis | -1.2024556 |
| Mean | 29328.758 |
| Median Absolute Deviation (MAD) | 14696 |
| Skewness | 9.3031589 × 10-5 |
| Sum | 1.534422 × 109 |
| Variance | 2.875878 × 108 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 39133 | 1 | < 0.1% |
| 39122 | 1 | < 0.1% |
| 39123 | 1 | < 0.1% |
| 39124 | 1 | < 0.1% |
| 39125 | 1 | < 0.1% |
| 39127 | 1 | < 0.1% |
| 39128 | 1 | < 0.1% |
| 39129 | 1 | < 0.1% |
| 39130 | 1 | < 0.1% |
| Other values (52308) | 52308 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 8 | 1 | |
| 9 | 1 | |
| 11 | 1 |
| Value | Count | Frequency (%) |
| 58644 | 1 | |
| 58643 | 1 | |
| 58642 | 1 | |
| 58641 | 1 | |
| 58639 | 1 | |
| 58638 | 1 | |
| 58637 | 1 | |
| 58636 | 1 | |
| 58635 | 1 | |
| 58634 | 1 |
person_age
Real number (ℝ)
High correlation 
| Distinct | 51 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 27.542051 |
| Minimum | 20 |
|---|---|
| Maximum | 84 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 817.5 KiB |
Quantile statistics
| Minimum | 20 |
|---|---|
| 5-th percentile | 22 |
| Q1 | 23 |
| median | 26 |
| Q3 | 30 |
| 95-th percentile | 39 |
| Maximum | 84 |
| Range | 64 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 6.0092298 |
|---|---|
| Coefficient of variation (CV) | 0.21818382 |
| Kurtosis | 5.406357 |
| Mean | 27.542051 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 1.9107078 |
| Sum | 1440945 |
| Variance | 36.110843 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 23 | 6914 | |
| 22 | 6282 | |
| 24 | 5698 | |
| 25 | 4503 | 8.6% |
| 27 | 4024 | 7.7% |
| 26 | 3451 | 6.6% |
| 28 | 3304 | 6.3% |
| 29 | 2920 | 5.6% |
| 30 | 2095 | 4.0% |
| 31 | 1705 | 3.3% |
| Other values (41) | 11422 |
| Value | Count | Frequency (%) |
| 20 | 8 | < 0.1% |
| 21 | 1600 | 3.1% |
| 22 | 6282 | |
| 23 | 6914 | |
| 24 | 5698 | |
| 25 | 4503 | |
| 26 | 3451 | |
| 27 | 4024 | |
| 28 | 3304 | |
| 29 | 2920 |
| Value | Count | Frequency (%) |
| 84 | 2 | < 0.1% |
| 80 | 2 | < 0.1% |
| 73 | 1 | < 0.1% |
| 70 | 10 | |
| 69 | 6 | |
| 66 | 9 | |
| 65 | 11 | |
| 64 | 9 | |
| 62 | 6 | |
| 61 | 13 |
person_income
Real number (ℝ)
| Distinct | 2376 |
|---|---|
| Distinct (%) | 4.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 64076.099 |
| Minimum | 9600 |
|---|---|
| Maximum | 1900000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 817.5 KiB |
Quantile statistics
| Minimum | 9600 |
|---|---|
| 5-th percentile | 28800 |
| Q1 | 42000 |
| median | 58195 |
| Q3 | 75531 |
| 95-th percentile | 120000 |
| Maximum | 1900000 |
| Range | 1890400 |
| Interquartile range (IQR) | 33531 |
Descriptive statistics
| Standard deviation | 35301.29 |
|---|---|
| Coefficient of variation (CV) | 0.55092758 |
| Kurtosis | 217.82996 |
| Mean | 64076.099 |
| Median Absolute Deviation (MAD) | 16805 |
| Skewness | 7.3712001 |
| Sum | 3.3523333 × 109 |
| Variance | 1.2461811 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 60000 | 3904 | 7.5% |
| 50000 | 2845 | 5.4% |
| 30000 | 2059 | 3.9% |
| 70000 | 1750 | 3.3% |
| 75000 | 1583 | 3.0% |
| 40000 | 1521 | 2.9% |
| 45000 | 1489 | 2.8% |
| 65000 | 1402 | 2.7% |
| 90000 | 1268 | 2.4% |
| 48000 | 1139 | 2.2% |
| Other values (2366) | 33358 |
| Value | Count | Frequency (%) |
| 9600 | 8 | < 0.1% |
| 10140 | 1 | < 0.1% |
| 12000 | 26 | |
| 12360 | 1 | < 0.1% |
| 12500 | 1 | < 0.1% |
| 12600 | 1 | < 0.1% |
| 12996 | 1 | < 0.1% |
| 13200 | 6 | < 0.1% |
| 14000 | 3 | < 0.1% |
| 14400 | 44 |
| Value | Count | Frequency (%) |
| 1900000 | 1 | < 0.1% |
| 1200000 | 1 | < 0.1% |
| 900000 | 4 | |
| 780000 | 3 | |
| 762000 | 2 | |
| 741600 | 1 | < 0.1% |
| 700000 | 1 | < 0.1% |
| 636000 | 1 | < 0.1% |
| 600000 | 3 | |
| 564000 | 1 | < 0.1% |
person_home_ownership
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 817.5 KiB |
| RENT | |
|---|---|
| MORTGAGE | |
| OWN | |
| OTHER | 81 |
Length
| Max length | 8 |
|---|---|
| Median length | 4 |
| Mean length | 5.645189 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | RENT |
|---|---|
| 2nd row | OWN |
| 3rd row | OWN |
| 4th row | RENT |
| 5th row | RENT |
Common Values
| Value | Count | Frequency (%) |
| RENT | 27274 | |
| MORTGAGE | 22191 | |
| OWN | 2772 | 5.3% |
| OTHER | 81 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| rent | 27274 | |
| mortgage | 22191 | |
| own | 2772 | 5.3% |
| other | 81 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 49546 | |
| E | 49546 | |
| T | 49546 | |
| G | 44382 | |
| N | 30046 | |
| O | 25044 | |
| M | 22191 | |
| A | 22191 | |
| W | 2772 | 0.9% |
| H | 81 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 295345 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| R | 49546 | |
| E | 49546 | |
| T | 49546 | |
| G | 44382 | |
| N | 30046 | |
| O | 25044 | |
| M | 22191 | |
| A | 22191 | |
| W | 2772 | 0.9% |
| H | 81 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 295345 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| R | 49546 | |
| E | 49546 | |
| T | 49546 | |
| G | 44382 | |
| N | 30046 | |
| O | 25044 | |
| M | 22191 | |
| A | 22191 | |
| W | 2772 | 0.9% |
| H | 81 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 295345 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| R | 49546 | |
| E | 49546 | |
| T | 49546 | |
| G | 44382 | |
| N | 30046 | |
| O | 25044 | |
| M | 22191 | |
| A | 22191 | |
| W | 2772 | 0.9% |
| H | 81 | < 0.1% |
person_emp_length
Real number (ℝ)
High correlation  Zeros 
| Distinct | 33 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.6890363 |
| Minimum | 0 |
|---|---|
| Maximum | 39 |
| Zeros | 6749 |
| Zeros (%) | 12.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 817.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2 |
| median | 4 |
| Q3 | 7 |
| 95-th percentile | 12 |
| Maximum | 39 |
| Range | 39 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 3.8749001 |
|---|---|
| Coefficient of variation (CV) | 0.82637451 |
| Kurtosis | 2.0111745 |
| Mean | 4.6890363 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 1.1703373 |
| Sum | 245321 |
| Variance | 15.01485 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 6749 | |
| 2 | 6482 | |
| 3 | 5779 | |
| 5 | 5273 | |
| 4 | 4899 | |
| 1 | 4622 | |
| 6 | 4371 | |
| 7 | 3822 | |
| 8 | 2673 | 5.1% |
| 9 | 2053 | 3.9% |
| Other values (23) | 5595 |
| Value | Count | Frequency (%) |
| 0 | 6749 | |
| 1 | 4622 | |
| 2 | 6482 | |
| 3 | 5779 | |
| 4 | 4899 | |
| 5 | 5273 | |
| 6 | 4371 | |
| 7 | 3822 | |
| 8 | 2673 | 5.1% |
| 9 | 2053 | 3.9% |
| Value | Count | Frequency (%) |
| 39 | 1 | < 0.1% |
| 31 | 4 | < 0.1% |
| 30 | 2 | < 0.1% |
| 29 | 4 | < 0.1% |
| 28 | 3 | < 0.1% |
| 27 | 5 | < 0.1% |
| 26 | 8 | |
| 25 | 7 | |
| 24 | 13 | |
| 23 | 10 |
loan_intent
Categorical
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 817.5 KiB |
| EDUCATION | |
|---|---|
| MEDICAL | |
| PERSONAL | |
| VENTURE | |
| DEBTCONSOLIDATION |
Length
| Max length | 17 |
|---|---|
| Median length | 15 |
| Mean length | 10.019592 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | EDUCATION |
|---|---|
| 2nd row | MEDICAL |
| 3rd row | PERSONAL |
| 4th row | VENTURE |
| 5th row | MEDICAL |
Common Values
| Value | Count | Frequency (%) |
| EDUCATION | 11031 | |
| MEDICAL | 9521 | |
| PERSONAL | 9003 | |
| VENTURE | 8953 | |
| DEBTCONSOLIDATION | 8217 | |
| HOMEIMPROVEMENT | 5593 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| education | 11031 | |
| medical | 9521 | |
| personal | 9003 | |
| venture | 8953 | |
| debtconsolidation | 8217 | |
| homeimprovement | 5593 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 72457 | |
| O | 55871 | |
| N | 51014 | |
| I | 42579 | |
| T | 42011 | |
| A | 37772 | 7.2% |
| D | 36986 | 7.1% |
| C | 28769 | 5.5% |
| L | 26741 | 5.1% |
| M | 26300 | 5.0% |
| Other values (7) | 103705 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 524205 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| E | 72457 | |
| O | 55871 | |
| N | 51014 | |
| I | 42579 | |
| T | 42011 | |
| A | 37772 | 7.2% |
| D | 36986 | 7.1% |
| C | 28769 | 5.5% |
| L | 26741 | 5.1% |
| M | 26300 | 5.0% |
| Other values (7) | 103705 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 524205 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| E | 72457 | |
| O | 55871 | |
| N | 51014 | |
| I | 42579 | |
| T | 42011 | |
| A | 37772 | 7.2% |
| D | 36986 | 7.1% |
| C | 28769 | 5.5% |
| L | 26741 | 5.1% |
| M | 26300 | 5.0% |
| Other values (7) | 103705 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 524205 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| E | 72457 | |
| O | 55871 | |
| N | 51014 | |
| I | 42579 | |
| T | 42011 | |
| A | 37772 | 7.2% |
| D | 36986 | 7.1% |
| C | 28769 | 5.5% |
| L | 26741 | 5.1% |
| M | 26300 | 5.0% |
| Other values (7) | 103705 |
loan_grade
Categorical
High correlation 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 817.5 KiB |
| A | |
|---|---|
| B | |
| C | |
| D | |
| E | 799 |
| Other values (2) | 137 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | B |
|---|---|
| 2nd row | C |
| 3rd row | A |
| 4th row | B |
| 5th row | A |
Common Values
| Value | Count | Frequency (%) |
| A | 18936 | |
| B | 18373 | |
| C | 9877 | |
| D | 4196 | 8.0% |
| E | 799 | 1.5% |
| F | 109 | 0.2% |
| G | 28 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| a | 18936 | |
| b | 18373 | |
| c | 9877 | |
| d | 4196 | 8.0% |
| e | 799 | 1.5% |
| f | 109 | 0.2% |
| g | 28 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 18936 | |
| B | 18373 | |
| C | 9877 | |
| D | 4196 | 8.0% |
| E | 799 | 1.5% |
| F | 109 | 0.2% |
| G | 28 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 52318 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| A | 18936 | |
| B | 18373 | |
| C | 9877 | |
| D | 4196 | 8.0% |
| E | 799 | 1.5% |
| F | 109 | 0.2% |
| G | 28 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 52318 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| A | 18936 | |
| B | 18373 | |
| C | 9877 | |
| D | 4196 | 8.0% |
| E | 799 | 1.5% |
| F | 109 | 0.2% |
| G | 28 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 52318 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| A | 18936 | |
| B | 18373 | |
| C | 9877 | |
| D | 4196 | 8.0% |
| E | 799 | 1.5% |
| F | 109 | 0.2% |
| G | 28 | 0.1% |
loan_amnt
Real number (ℝ)
High correlation 
| Distinct | 499 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9046.9164 |
| Minimum | 500 |
|---|---|
| Maximum | 35000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 817.5 KiB |
Quantile statistics
| Minimum | 500 |
|---|---|
| 5-th percentile | 2400 |
| Q1 | 5000 |
| median | 8000 |
| Q3 | 12000 |
| 95-th percentile | 20000 |
| Maximum | 35000 |
| Range | 34500 |
| Interquartile range (IQR) | 7000 |
Descriptive statistics
| Standard deviation | 5448.1013 |
|---|---|
| Coefficient of variation (CV) | 0.60220534 |
| Kurtosis | 1.8019155 |
| Mean | 9046.9164 |
| Median Absolute Deviation (MAD) | 3000 |
| Skewness | 1.2084345 |
| Sum | 4.7331657 × 108 |
| Variance | 29681808 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10000 | 6668 | 12.7% |
| 5000 | 4723 | 9.0% |
| 6000 | 4316 | 8.2% |
| 12000 | 4031 | 7.7% |
| 8000 | 3090 | 5.9% |
| 15000 | 3026 | 5.8% |
| 4000 | 2286 | 4.4% |
| 3000 | 2085 | 4.0% |
| 7000 | 1937 | 3.7% |
| 20000 | 1581 | 3.0% |
| Other values (489) | 18575 |
| Value | Count | Frequency (%) |
| 500 | 1 | < 0.1% |
| 700 | 1 | < 0.1% |
| 900 | 1 | < 0.1% |
| 1000 | 361 | |
| 1050 | 2 | < 0.1% |
| 1200 | 152 | |
| 1225 | 2 | < 0.1% |
| 1250 | 2 | < 0.1% |
| 1275 | 1 | < 0.1% |
| 1300 | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| 35000 | 119 | |
| 32000 | 1 | < 0.1% |
| 31000 | 1 | < 0.1% |
| 30750 | 1 | < 0.1% |
| 30000 | 83 | |
| 29800 | 2 | < 0.1% |
| 29100 | 1 | < 0.1% |
| 28000 | 43 | 0.1% |
| 27575 | 1 | < 0.1% |
| 27500 | 1 | < 0.1% |
loan_int_rate
Real number (ℝ)
High correlation 
| Distinct | 351 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.623311 |
| Minimum | 5.42 |
|---|---|
| Maximum | 23.22 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 817.5 KiB |
Quantile statistics
| Minimum | 5.42 |
|---|---|
| 5-th percentile | 6.03 |
| Q1 | 7.88 |
| median | 10.74 |
| Q3 | 12.87 |
| 95-th percentile | 15.65 |
| Maximum | 23.22 |
| Range | 17.8 |
| Interquartile range (IQR) | 4.99 |
Descriptive statistics
| Standard deviation | 3.0031014 |
|---|---|
| Coefficient of variation (CV) | 0.28268977 |
| Kurtosis | -0.72406184 |
| Mean | 10.623311 |
| Median Absolute Deviation (MAD) | 2.74 |
| Skewness | 0.19732536 |
| Sum | 555790.39 |
| Variance | 9.018618 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10.99 | 1991 | 3.8% |
| 7.51 | 1976 | 3.8% |
| 7.88 | 1594 | 3.0% |
| 7.49 | 1449 | 2.8% |
| 13.49 | 1293 | 2.5% |
| 11.49 | 1200 | 2.3% |
| 7.9 | 1163 | 2.2% |
| 5.42 | 1019 | 1.9% |
| 11.71 | 981 | 1.9% |
| 6.03 | 978 | 1.9% |
| Other values (341) | 38674 |
| Value | Count | Frequency (%) |
| 5.42 | 1019 | |
| 5.43 | 1 | < 0.1% |
| 5.79 | 731 | |
| 5.99 | 504 | |
| 6 | 4 | < 0.1% |
| 6.03 | 978 | |
| 6.05 | 1 | < 0.1% |
| 6.17 | 322 | 0.6% |
| 6.39 | 97 | 0.2% |
| 6.42 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 23.22 | 1 | < 0.1% |
| 22.11 | 1 | < 0.1% |
| 22.06 | 1 | < 0.1% |
| 21.74 | 4 | |
| 21.64 | 1 | < 0.1% |
| 21.36 | 6 | |
| 21.21 | 4 | |
| 20.89 | 4 | |
| 20.86 | 2 | < 0.1% |
| 20.8 | 1 | < 0.1% |
loan_percent_income
Real number (ℝ)
High correlation 
| Distinct | 60 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.15574009 |
| Minimum | 0 |
|---|---|
| Maximum | 0.83 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 817.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.04 |
| Q1 | 0.09 |
| median | 0.14 |
| Q3 | 0.2 |
| 95-th percentile | 0.33 |
| Maximum | 0.83 |
| Range | 0.83 |
| Interquartile range (IQR) | 0.11 |
Descriptive statistics
| Standard deviation | 0.089372179 |
|---|---|
| Coefficient of variation (CV) | 0.57385468 |
| Kurtosis | 0.70904537 |
| Mean | 0.15574009 |
| Median Absolute Deviation (MAD) | 0.06 |
| Skewness | 0.92871906 |
| Sum | 8148.01 |
| Variance | 0.0079873864 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.1 | 3070 | 5.9% |
| 0.08 | 2674 | 5.1% |
| 0.17 | 2537 | 4.8% |
| 0.11 | 2514 | 4.8% |
| 0.13 | 2463 | 4.7% |
| 0.09 | 2438 | 4.7% |
| 0.12 | 2359 | 4.5% |
| 0.07 | 2347 | 4.5% |
| 0.06 | 2294 | 4.4% |
| 0.14 | 2272 | 4.3% |
| Other values (50) | 27350 |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 0.01 | 110 | 0.2% |
| 0.02 | 407 | 0.8% |
| 0.03 | 1009 | 1.9% |
| 0.04 | 1593 | |
| 0.05 | 2016 | |
| 0.06 | 2294 | |
| 0.07 | 2347 | |
| 0.08 | 2674 | |
| 0.09 | 2438 |
| Value | Count | Frequency (%) |
| 0.83 | 1 | < 0.1% |
| 0.63 | 1 | < 0.1% |
| 0.59 | 1 | < 0.1% |
| 0.56 | 1 | < 0.1% |
| 0.55 | 1 | < 0.1% |
| 0.54 | 1 | < 0.1% |
| 0.53 | 2 | < 0.1% |
| 0.52 | 4 | < 0.1% |
| 0.51 | 11 | < 0.1% |
| 0.5 | 65 |
cb_person_default_on_file
Boolean
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 459.8 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 44727 | |
| True | 7591 | 14.5% |
cb_person_cred_hist_length
Real number (ℝ)
High correlation 
| Distinct | 29 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.8141366 |
| Minimum | 2 |
|---|---|
| Maximum | 30 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 817.5 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 3 |
| median | 4 |
| Q3 | 8 |
| 95-th percentile | 14 |
| Maximum | 30 |
| Range | 28 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 4.0249918 |
|---|---|
| Coefficient of variation (CV) | 0.69227679 |
| Kurtosis | 3.4727067 |
| Mean | 5.8141366 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 1.6160129 |
| Sum | 304184 |
| Variance | 16.200559 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 9590 | |
| 2 | 9455 | |
| 4 | 9418 | |
| 9 | 3124 | 6.0% |
| 8 | 3097 | 5.9% |
| 7 | 3055 | 5.8% |
| 6 | 3034 | 5.8% |
| 5 | 2989 | 5.7% |
| 10 | 2986 | 5.7% |
| 14 | 812 | 1.6% |
| Other values (19) | 4758 |
| Value | Count | Frequency (%) |
| 2 | 9455 | |
| 3 | 9590 | |
| 4 | 9418 | |
| 5 | 2989 | 5.7% |
| 6 | 3034 | 5.8% |
| 7 | 3055 | 5.8% |
| 8 | 3097 | 5.9% |
| 9 | 3124 | 6.0% |
| 10 | 2986 | 5.7% |
| 11 | 770 | 1.5% |
| Value | Count | Frequency (%) |
| 30 | 25 | |
| 29 | 21 | |
| 28 | 34 | |
| 27 | 44 | |
| 26 | 26 | |
| 25 | 29 | |
| 24 | 40 | |
| 23 | 32 | |
| 22 | 31 | |
| 21 | 34 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 45517 | |
| 1 | 6801 | 13.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 45517 | |
| 1 | 6801 | 13.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 45517 | |
| 1 | 6801 | 13.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 52318 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 45517 | |
| 1 | 6801 | 13.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 52318 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 45517 | |
| 1 | 6801 | 13.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 52318 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 45517 | |
| 1 | 6801 | 13.0% |
age_of_first_emp
Real number (ℝ)
High correlation 
| Distinct | 59 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 22.853014 |
| Minimum | 14 |
|---|---|
| Maximum | 82 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 817.5 KiB |
Quantile statistics
| Minimum | 14 |
|---|---|
| 5-th percentile | 16 |
| Q1 | 17 |
| median | 22 |
| Q3 | 26 |
| 95-th percentile | 36 |
| Maximum | 82 |
| Range | 68 |
| Interquartile range (IQR) | 9 |
Descriptive statistics
| Standard deviation | 6.7447909 |
|---|---|
| Coefficient of variation (CV) | 0.29513791 |
| Kurtosis | 3.686276 |
| Mean | 22.853014 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 1.497953 |
| Sum | 1195624 |
| Variance | 45.492205 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 16 | 11366 | |
| 22 | 3782 | 7.2% |
| 23 | 3600 | 6.9% |
| 21 | 3438 | 6.6% |
| 20 | 3138 | 6.0% |
| 24 | 2886 | 5.5% |
| 19 | 2819 | 5.4% |
| 25 | 2425 | 4.6% |
| 26 | 2222 | 4.2% |
| 18 | 2193 | 4.2% |
| Other values (49) | 14449 |
| Value | Count | Frequency (%) |
| 14 | 42 | 0.1% |
| 15 | 831 | 1.6% |
| 16 | 11366 | |
| 17 | 1463 | 2.8% |
| 18 | 2193 | 4.2% |
| 19 | 2819 | 5.4% |
| 20 | 3138 | 6.0% |
| 21 | 3438 | 6.6% |
| 22 | 3782 | 7.2% |
| 23 | 3600 | 6.9% |
| Value | Count | Frequency (%) |
| 82 | 1 | < 0.1% |
| 81 | 1 | < 0.1% |
| 73 | 2 | < 0.1% |
| 70 | 6 | |
| 69 | 4 | |
| 68 | 2 | < 0.1% |
| 66 | 4 | |
| 65 | 3 | |
| 64 | 4 | |
| 63 | 3 |
Interactions
Correlations
| age_of_first_emp | cb_person_cred_hist_length | cb_person_default_on_file | id | loan_amnt | loan_grade | loan_int_rate | loan_intent | loan_percent_income | loan_status | person_age | person_emp_length | person_home_ownership | person_income | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| age_of_first_emp | 1.000 | 0.567 | 0.047 | 0.004 | -0.021 | 0.033 | 0.079 | 0.067 | 0.015 | 0.070 | 0.669 | -0.628 | 0.096 | -0.056 |
| cb_person_cred_hist_length | 0.567 | 1.000 | 0.006 | 0.006 | 0.047 | 0.012 | -0.001 | 0.092 | -0.029 | 0.026 | 0.805 | 0.034 | 0.043 | 0.103 |
| cb_person_default_on_file | 0.047 | 0.006 | 1.000 | 0.000 | 0.046 | 0.648 | 0.609 | 0.029 | 0.035 | 0.181 | 0.014 | 0.058 | 0.100 | 0.011 |
| id | 0.004 | 0.006 | 0.000 | 1.000 | 0.008 | 0.011 | 0.004 | 0.003 | 0.011 | 0.020 | 0.006 | 0.003 | 0.000 | -0.006 |
| loan_amnt | -0.021 | 0.047 | 0.046 | 0.008 | 1.000 | 0.072 | 0.071 | 0.034 | 0.719 | 0.132 | 0.061 | 0.092 | 0.069 | 0.368 |
| loan_grade | 0.033 | 0.012 | 0.648 | 0.011 | 0.072 | 1.000 | 0.716 | 0.029 | 0.067 | 0.450 | 0.013 | 0.046 | 0.123 | 0.009 |
| loan_int_rate | 0.079 | -0.001 | 0.609 | 0.004 | 0.071 | 0.716 | 1.000 | 0.027 | 0.136 | 0.399 | -0.001 | -0.114 | 0.130 | -0.087 |
| loan_intent | 0.067 | 0.092 | 0.029 | 0.003 | 0.034 | 0.029 | 0.027 | 1.000 | 0.016 | 0.093 | 0.090 | 0.047 | 0.093 | 0.002 |
| loan_percent_income | 0.015 | -0.029 | 0.035 | 0.011 | 0.719 | 0.067 | 0.136 | 0.016 | 1.000 | 0.419 | -0.048 | -0.062 | 0.093 | -0.327 |
| loan_status | 0.070 | 0.026 | 0.181 | 0.020 | 0.132 | 0.450 | 0.399 | 0.093 | 0.419 | 1.000 | 0.023 | 0.114 | 0.238 | 0.026 |
| person_age | 0.669 | 0.805 | 0.014 | 0.006 | 0.061 | 0.013 | -0.001 | 0.090 | -0.048 | 0.023 | 1.000 | 0.061 | 0.043 | 0.150 |
| person_emp_length | -0.628 | 0.034 | 0.058 | 0.003 | 0.092 | 0.046 | -0.114 | 0.047 | -0.062 | 0.114 | 0.061 | 1.000 | 0.172 | 0.222 |
| person_home_ownership | 0.096 | 0.043 | 0.100 | 0.000 | 0.069 | 0.123 | 0.130 | 0.093 | 0.093 | 0.238 | 0.043 | 0.172 | 1.000 | 0.026 |
| person_income | -0.056 | 0.103 | 0.011 | -0.006 | 0.368 | 0.009 | -0.087 | 0.002 | -0.327 | 0.026 | 0.150 | 0.222 | 0.026 | 1.000 |
Missing values
Sample
| id | person_age | person_income | person_home_ownership | person_emp_length | loan_intent | loan_grade | loan_amnt | loan_int_rate | loan_percent_income | cb_person_default_on_file | cb_person_cred_hist_length | loan_status | age_of_first_emp | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 37 | 35000 | RENT | 0.0 | EDUCATION | B | 6000 | 11.49 | 0.17 | N | 14 | 0 | 37.0 |
| 1 | 1 | 22 | 56000 | OWN | 6.0 | MEDICAL | C | 4000 | 13.35 | 0.07 | N | 2 | 0 | 16.0 |
| 2 | 2 | 29 | 28800 | OWN | 8.0 | PERSONAL | A | 6000 | 8.90 | 0.21 | N | 10 | 0 | 21.0 |
| 3 | 3 | 30 | 70000 | RENT | 14.0 | VENTURE | B | 12000 | 11.11 | 0.17 | N | 5 | 0 | 16.0 |
| 4 | 4 | 22 | 60000 | RENT | 2.0 | MEDICAL | A | 6000 | 6.92 | 0.10 | N | 3 | 0 | 20.0 |
| 5 | 5 | 27 | 45000 | RENT | 2.0 | VENTURE | A | 9000 | 8.94 | 0.20 | N | 5 | 0 | 25.0 |
| 6 | 6 | 25 | 45000 | MORTGAGE | 9.0 | EDUCATION | A | 12000 | 6.54 | 0.27 | N | 3 | 0 | 16.0 |
| 8 | 8 | 37 | 69600 | RENT | 11.0 | EDUCATION | D | 5000 | 14.84 | 0.07 | Y | 11 | 0 | 26.0 |
| 9 | 9 | 35 | 110000 | MORTGAGE | 0.0 | DEBTCONSOLIDATION | C | 15000 | 12.98 | 0.14 | Y | 6 | 0 | 35.0 |
| 11 | 11 | 22 | 33000 | RENT | 6.0 | PERSONAL | B | 10000 | 11.12 | 0.30 | N | 2 | 1 | 16.0 |
| id | person_age | person_income | person_home_ownership | person_emp_length | loan_intent | loan_grade | loan_amnt | loan_int_rate | loan_percent_income | cb_person_default_on_file | cb_person_cred_hist_length | loan_status | age_of_first_emp | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 58634 | 58634 | 30 | 85000 | MORTGAGE | 6.0 | PERSONAL | A | 5000 | 7.51 | 0.06 | N | 7 | 0 | 24.0 |
| 58635 | 58635 | 32 | 69000 | RENT | 0.0 | DEBTCONSOLIDATION | B | 12000 | 10.20 | 0.17 | N | 7 | 1 | 32.0 |
| 58636 | 58636 | 24 | 37000 | RENT | 3.0 | EDUCATION | C | 9000 | 13.49 | 0.24 | Y | 2 | 0 | 21.0 |
| 58637 | 58637 | 24 | 75000 | RENT | 8.0 | VENTURE | B | 4000 | 10.75 | 0.05 | N | 4 | 0 | 16.0 |
| 58638 | 58638 | 29 | 46610 | MORTGAGE | 1.0 | PERSONAL | D | 2600 | 17.58 | 0.05 | N | 6 | 1 | 28.0 |
| 58639 | 58639 | 22 | 70000 | RENT | 6.0 | DEBTCONSOLIDATION | A | 10000 | 7.29 | 0.14 | N | 4 | 0 | 16.0 |
| 58641 | 58641 | 28 | 28800 | RENT | 0.0 | MEDICAL | C | 10000 | 12.73 | 0.35 | N | 8 | 1 | 28.0 |
| 58642 | 58642 | 23 | 44000 | RENT | 7.0 | EDUCATION | D | 6800 | 16.00 | 0.15 | N | 2 | 1 | 16.0 |
| 58643 | 58643 | 22 | 30000 | RENT | 2.0 | EDUCATION | A | 5000 | 8.90 | 0.17 | N | 3 | 0 | 20.0 |
| 58644 | 58644 | 31 | 75000 | MORTGAGE | 2.0 | VENTURE | B | 15000 | 11.11 | 0.20 | N | 5 | 0 | 29.0 |